Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition

نویسندگان

چکیده

In the deployment of scene-text spotting systems on mobile platforms, lightweight models with low computation are preferable. concept, end-to-end (E2E) text is suitable for such purposes because it performs detection and recognition in a single model. However, current state-of-the-art E2E methods rely heavy feature extractors, recurrent sequence modellings, complex shape aligners to pursue accuracy, which means their computations still heavy. We explore opposite direction: How far can we go without bells whistles spotting? To this end, propose text-spotting method that consists simple convolutions few post-processes, named Context-Free TextSpotter. Experiments using standard benchmarks show TextSpotter achieves real-time GPU only three million parameters, smallest fastest among existing deep spotters, an acceptable transcription quality degradation compared heavier ones. Further, demonstrate our spotter run smartphone affordable latency, valuable building stand-alone OCR applications.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

End-To-End Face Detection and Recognition

Plenty of face detection and recognition methods have been proposed and got delightful results in decades. Common face recognition pipeline consists of: 1) face detection, 2) face alignment, 3) feature extraction, 4) similarity calculation, which are separated and independent from each other. The separated face analyzing stages lead the model redundant calculation and are hard for end-to-end tr...

متن کامل

SEE: Towards Semi-Supervised End-to-End Scene Text Recognition

Detecting and recognizing text in natural scene images is a challenging, yet not completely solved task. In recent years several new systems that try to solve at least one of the two sub-tasks (text detection and text recognition) have been proposed. In this paper we present SEE, a step towards semi-supervised neural networks for scene text detection and recognition, that can be optimized end-t...

متن کامل

End-to-End Text Recognition with Hybrid HMM Maxout Models

The problem of detecting and recognizing text in natural scenes has proved to be more challenging than its counterpart in documents, with most of the previous work focusing on a single part of the problem. In this work, we propose new solutions to the character and word recognition problems and then show how to combine these solutions in an end-to-end text-recognition system. We do so by levera...

متن کامل

End-to-end Window-Constrained Scheduling for Real-Time Communication

This paper extends our original work on window-constrained scheduling, to address the problem of meeting end-to-end service guarantees across a sequence of servers. We describe an algorithm, called Multi-hop Virtual Deadline Scheduling (MVDS), that attempts to minimize end-to-end window-constraint violations, while maximizing link utilization for a series of real-time streams. The challenge pos...

متن کامل

JEJUNAL EVERSION MUCOSECTOMY AND INVAGINATION: AN INNOVATIVE TECHNIQUE FOR THE END TO END PANCREATICOJEJUNOSTOMY

 ABSTRACT Background: The pancreatojejunostomy has notoriously been known to carry a high rate of operative complications, morbidity and mortality, mainly due to anastomotic leak and ensuing septic complications. Objective: In order to decrease anastomotic leak and its attendant morbidity and mortality in operations requiring a pancreato-jejunal anastomosis, and also in order to simplify the op...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-86331-9_16